Tag

#mathematical reasoning

3 articles

Mistral's open-source Leanstral 1.5 aces formal math benchmarks and catches real bugs in code

Learn how Mistral AI's Leanstral 1.5 model combines formal verification with AI to solve mathematical problems and find real bugs in open-source code.

Jul 337

New math benchmark reveals AI models confidently solve problems that have no solution

A new AI benchmark reveals that models confidently solve math problems that have no solution, exposing a key gap in their reasoning capabilities.

May 1755

Our First Proof submissions

OpenAI shares proof attempts from its AI model tackling expert-level mathematical problems in the First Proof challenge, showcasing advanced reasoning capabilities.

Feb 2385